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We describe our ongoing project of formalization of algebraic methods for geometry theorem proving 
(Wu's method and the Grobner bases method), their implementation and integration in educational 
tools. The project includes formal verification of the algebraic methods within Isabelle/HOL proof 
assistant and development of a new, open-source Java implementation of the algebraic methods. The 
project should fill-in some gaps still existing in this area (e.g., the lack of formal links between al- 
gebraic methods and synthetic geometry and the lack of self-contained implementations of algebraic 
methods suitable for integration with dynamic geometry tools) and should enable new applications 
of theorem proving in education. 



1 Introduction 



The field of automated deduction in geometry has been very successful in the last several decades and 
a number of geometry automated theorem provers (GATP) have been developed. Most successful of 
these are based on algebraic methods, primarily Wu's method B31 and the method of Grobner bases 10. 
The algebraic methods require expressing geometric properties as polynomial equations in the Cartesian 
plane, and then using algebraic techniques for dealing with these equations. There are several implemen- 
tations of these methods and hundreds of complex geometry theorems have been proved automatically 
by them ifTTTl . However, despite these advances, there are still some gaps in this area, preventing wider 
applications of the algebraic methods for geometry, especially in education and in formal explorations of 
geometry. Some of these gaps are: 

• There are no free, self-cointained, open source, and well-documented implementations of GATPs 
based on algebraic methods, suitable for integration with other tools (e.g., dynamic geometry 
tools). 

• There is no support for automated theorem proving in dynamic geometry tools most widely used 
on all levels of mathematical education (e.g., GeoGebra |25l ) and this limits their applicability. 

• Only fragments of geometry and algebraic methods have been formalized within proof assistants 
(e.g., Isabelle, Coq), and there are still no formalized links between algebraic methods and syn- 
thetic geometry, giving formal correctness arguments for algebraic methods for geometry. 

We aim at integrating algebraic methods in geometry with dynamic geometry software (DGS), widely 
used in education, and in interactive proof assistants, used for formalizing geometry. One of our goals 
is to address formalization of algebraic methods and their integration into the interactive proof assistant 
Isabelle/HOL. A major concern when using algebraic methods is the lack of formally established connec- 
tions between proofs produced by these methods with classical synthetic proofs in geometry (i.e., with 
Hilbert's or Tarski's geometry). We aim to make a strong, formal link between algebraic methods and 
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synthetic geometry by formalizing both within Isabelle/HOL. Once the correctness of algebraic methods 
is formally established within a proof-assistant, these methods can be used as reflective tactics that help 
automating significant portions of proofs in exploring geometry in a formal setting. Our other goal is 
also to develop a corresponding Java implementations of Wu's method and the Grobner bases method 
conforming to open-source code and documentation standards. These should directly support current 
standardization initiatives for geometry formats (e.g., i2g, TGTP) and should be suitable for integration 
into DGS. Both these goals should enable new applications of (both automated and interactive) theorem 
provers in education and in formalizing mathematics. 

In this paper we report on the current status of our project. So far, our Isabelle formalization includes 
formalization of the Cartesian plane geometry, the translation of constructive geometry statements into 
algebraic form, and connections between the algebraic form of a statement and its interpretation in the 
Cartesian plane. Concerning Java implementation, we have implemented Wu's method. Before present- 
ing details on this developments, we give a brief account on the related work and on relevant background 
information. In all contexts, only plane (and not space) geometry is considered. 

Overview of the paper. In Section[2]we present related work and state-of-the-art results in the relevant 
areas; in Section [3] we briefly discuss algebraic methods and some common grounds for their formaliza- 
tion and implementation; in Section [4] we present the current status of our formalization of algebraic 
methods within Isabelle/HOL; in Section [5] we present our Java implementation of Wu's method; in 
Section [6] we discuss possible integrations of our developments within a wider context; in Section [7J we 
discuss potential applications in education, and, in Section[8]we draw some final conclusions. 



2 Related Work 

Dynamic Geometry Tools. Dynamic geometry tools are mathematical software tools that allow in- 
teractive work and visualization of geometric objects, by linking synthetic geometry (most often — 
Euclidean) with its standard models (e.g., Cartesian model). These tools are used in mathematical edu- 
cation, but some of them are also used for producing digital mathematical illustrations and animations. 
The common experience (not scientifically supported, but with a number of positive empiric cases stud- 
ies) is that dynamic geometry tools significantly help students to acquire knowledge about geometric 
objects. Some of the most popular dynamic geometry tools are GeoGebra^ Cinderella^ Geometer's 
Sketchpad^ Cabri^ GCLC^VEukleides^ An overview of interactive geometry tools and their features 
can be found on the Internet]!] 



Automated Theorem Proving in Geometry. Automated theorem proving in geometry has a history 
more than fifty years long lfT2l . In the 1950s Gelernter created a theorem prover that could find solutions 
to a number of problems taken from high-school textbooks in plane geometry [181- The biggest successes 
in automated theorem proving in geometry were achieved (i.e., the most complex theorems were proved) 
by algebraic provers based on Wu's method ifTTl l45ll and Grobner bases method [7J El. Instead of 
readable, traditional geometry proofs, these methods produce only a yes/no answer with a corresponding 

http : / /www . geogebra . org 
^http : //www. Cinderella. de 
"http : //www. keypress . com/ sketchpad/ 

http : //www. ca bri . com 

5 ht tp : //argo .matf .bg. ac .rs| 

f http : //www. eukleides . org 

1 http : / /en . wikipedia . org/ wiki/Interact ive_geometry_sof tware 
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algebraic argument. Coordinate-free methods, such as the area method |[T3ll27l and the full angle method 
EJHU, often produce readable proofs, but for many conjectures these methods still deal with extremely 
complex expressions involving certain geometry quantities. An approach based on a deductive database 
and forward chaining works over a suitably selected set of higher-order lemmas and can prove complex 
geometry theorems, but still has a smaller scope than algebraic provers [9]. Quaife used a resolution 
theorem prover to prove theorems in Tarski's geometry j38l . A theorem prover based on coherent-logic 
[2] can produce both readable and formal proofs of geometry conjectures of a certain sort Rot . 



Integration of Dynamic Geometry and Automated Theorem Proving. Just a few dynamic geome- 
try systems have support for automated theorem proving, typically based on Wu's method, the Grobner 
bases method, and the area method. Geometry Experj^] (GEX) is a dynamic geometry tool focused on 
automated theorem proving and it implements Wu's, the Grobner bases, the vector, the full-angle, and 
the area method ifTOl . MMP/Geometei[^]is a new, Chinese, version of GEX [ 17 ]. It automates geometry 
diagram generation, geometry theorem proving, and geometry theorem discovering. Java Geometry Ex- 
pert (JGEX) is a new, Java version of GEX [46]. It combines dynamic geometry, automated geometry 
theorem proving, and, as its most distinctive part, visual dynamic presentation of proofs. The systems 
from the GEX family are publicly available, but only JGEX is open-source. GEOTHEPp] a module 
of Epsilon, is implemented in Maple, with drawing routines implemented in Java ll43ll . Theorema^] is 
a general mathematical tool, implemented in Mathematica, with a uniform framework for computing, 
problem solving, and theorem proving 0. It has some features of dynamic geometry systems and has 
support for automated theorem proving in geometry ll37l . Discover is a system for automated proving 
and discovery in geometry, based on two computer algebra systems, CoCoA and Mathematica Q. Ge- 
ometry Explorer is a dynamic geometry tool proving support for the full-angle method for automated 
theorem proving fi4ll . GCL(J/^1 is a geometry tool, implemented in C++, for visualizing geometry, for 
producing mathematical illustration and for reasoning about geometry constructions. It is based on a 
custom geometry language and provides support for three methods for automated theorem proving ||26l . 
GeoProof is a dynamic geometry tool with built-in verified (within Coq) geometry theorem prover based 
on the area method ||3"3l . 



Formalizations of Geometry. There are a number of formalizations of fragments of various geome- 
tries within proof assistants. Parts of Hilbert's seminal book Foundations of Geometry have been for- 
malized in Isabelle/Isar [32] and in Coq ifTBll . Within Coq, there are also formalizations of von Plato's 
constructive geometry [29], French high school geometry [21], Tarski's geometry |[34l , ruler and com- 
pass geometry [16], projective geometry [31], etc. There are efforts to integrate Grobner bases solvers 
with proof-assistants |[8j [35], but only the general part, not the one dealing with geometry statements. 
Verified automated theorem proving in geometry based on certificates has been developed in Coq for 
the Grobner bases method [20] and for Wu's method [19]. In both these developments, certificates are 
generated using external provers. Also, in both developments, there is no a proved link with synthetic 
geometry, so — substantially — only algebraic conjectures are considered. We are not aware of a full 
algebraic method for geometry theorem proving formalized within a theorem prover. We are also not 

s http : //www.mmrc . iss . ac . cn/gex/ 

' http://www.mmrc . iss . ac . cn /mmsof t/ 

1( http : //www . j gex . net/ 

11 http://www-salsa.lip6.fr/~wang/GE0THER 

12 http : //www. theorema. org/ 

13 http : //www.matf .bg. ac . rs/~ janicic/gclc 
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aware of a formalized relationship between algebraic methods and synthetic geometry — i.e., of for- 
malized claim that if some geometry conjecture is proved by an algebraic method, then it is theorem of 
synthetic geometry. 

3 Algebraic Methods in Geometry 

Algebraic methods in geometry are well described in literature (9l (TTJ |42j |45j and many of its aspects 
are already considered to be folklore. Still, some issues deserve an attention, especially in the context of 
interactive theorem proving. In this section we give a brief account on algebraic methods in geometry, 
in order to make the paper self-contained, but also to stress some issues relevant for formalizing the link 
between synthetic geometry and the algebraic methods. 

3.1 Synthetic Euclidean Geometry and Analytic (Cartesian) Geometry 

Synthetic Euclidean geometry is geometry based on a, typically small, set of primitive notions and ax- 
ioms. There is a number of variants of axiom systems for Euclidean geometry and the most influential 
and important ones are Euclid's system (from his seminal "Elements") and its modern reincarnations ifTTl . 
Hilbert's system ll24ll . and Tarski's system ll39ll . Tarski's geometry is built in first-order logic and is less 
powerful than Hilbert's one (built in higher-order logic). 

In analytic (or Cartesian) geometry, points are represented as pairs of real numbers (Cartesian coor- 
dinates) and lines and curves as algebraic equations. All axioms of Euclidean geometry are valid (using 
the standard, natural interpretation) in Cartesian plane, so this plane is a model of Euclidean geometry 
and each Euclidean theorem is valid in Cartesian plane. It can be proved that the opposite also holds: 
each statement valid in Cartesian plane is a theorem of (any reasonably built) synthetical geometry. 

3.2 Algebraization of Geometry Statements 

Algebraic methods, used as methods for automated theorem proving in geometry for theorems of con- 
structive type (i.e., conjectures about geometric objects obtained by geometric constructions), introduce 
(symbolic) coordinates for geometric objects involved (points, and possibly lines), express geometric 
constructions and statements as algebraic (multivariate polynomial) equations involving introduced co- 
ordinates and then use algebraic means to prove that the statement follows from the construction. 

The standard algebrization procedure introduces fresh symbolic variables for point coordinates and 
introduces (polynomial) equations that characterize every construction step and the statement to be 
proved. Although for lines involved in the construction unknown coefficients could be introduced, the 
standard procedure avoids that and uses only points (while lines are specified only implicitly). Each con- 
struction starts from a set of free points and introduces dependent points along the way. In some cases, 
dependent points are chosen with a degree of freedom (e.g., choosing a random point on line). Each 
point gets a pair of coordinates represented by symbolic variables. Free variables are usually denoted by 
Uj (i = 0, 1,2, . . .), while the dependent ones are denoted by (i = 0, 1,2, . . .). If a point is free, both its 
coordinates will be free variables. If a point is, dependent, but with one degree of freedom, one coordi- 
nate will be a free, while the another one will be a dependent variable^ If a point is dependent both its 

14 However, choosing which one should be free and which one dependent is not trivial and requires special attention. For 
example, given that A is a point on a line I, one of its coordinates could be free while the other is dependent and calculated 
from the line constraints. However, if / is parallel to the x-axis, then the y coordinate of the point A cannot be free. Also, if I is 
parallel to the y-axis then the x coordinate of the point A cannot be free. 
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coordinates will be dependent ones. 

Geometry constraints over points can be formulated as algebraic constraints over the point coordi- 
nates (i.e., as polynomial equations over the introduced symbolic variables). For example, assume that 
symbolic coordinates of the point A are (x a ,y a ), the point B are (xb,y>b), and the point C are (x c ,y c ). The 
fact that A is the midpoint of the segment BC corresponds to the algebraic conditions 2x a = xy + x c 
and 2y a = yb + y c - The fact that A, B, and C are collinear corresponds to the algebraic condition 
(x a — Xb)(jb — yc) = (ja —yb)(xb — x c ). Similar connections are formulated for other basic geometry 
relationships (parallel lines, perpendicular lines, segment bisectors, etc.). 

Example 1 Let ABC be a triangle, and let B\ be the midpoint of the edge AC and C\ be the midpoint of 
the edge AB. Then, the midsegment B\C\ is parallel to BC. 



A(uq,u\) 




B(u2,m) C(u4,u$) 

Figure 1 : Midsegment theorem 

In this example, A, B, and C are free points so they are introduced symbolic variables A(uq,u\), 
B(u2,u$), and C(u4,us). Points B\ and C\ are dependent so they are introduced symbolic variables 
B\ (xo,xi ) and C\ (x 2 ,X3). Since B\ is the midpoint of AC, it holds that 2xo = uq + U4 and 2x\ = u\ + W5. 
Since C\ is the midpoint of AB, it holds that 2x2 = "0 + "2 and 2x3 = Mi + «3- These four equations 
come from the description of the construction, i.e., from the premises of the conjecture. In order to 
show that B\C\ is parallel to BC, it suffices to show that (x 2 —xq)(u5 — W3) = (X3 — X\)(u4 — U2) holds. 
This equation corresponds to the conclusion of the conjecture. So, the geometric problem is reduced 
to showing that every n-tuple satisfying the first four equations (stemming from the construction) also 
satisfies the last equation (stemming from the conclusion), i.e., to show that 

Vwo U\ U2U3 U4 Us Xo X\ X2 X3 G R. 2xo = "0 + "4 A 2xi = U\ + Us A 2X2 = UQ + U2 A 2X3 = «1 + «3 

(x 2 -X )(m 5 -M 3 ) = (X3-Xl)(w 4 -M 2 ). 

Note that in the above example, the condition that ABC is a triangle is not translated into conditions 
that A, B, C are pairwise different. Also, the condition that B\C\ is parallel to BC is represented by 
equation (x 2 — xq)(us — W3) = (X3 —x\)(u4 — w 2 ). However, this algebraic equation is actually equivalent 
to the following, weaker condition: B = C or B\ = Ci or B\^C\ is parallel to BC. Therefore, when 
translated back in geometry terms, the conjecture that is to be proved by an algebraic method is: 

Let B\ be the midpoint of the segment AC and C\ be the midpoint of the segment AB. Then, the 
segment B\C\ is parallel to BC or B is identical to C or B\ is identical to C\. 

Since, B\ ^ C\ follows from B ^ C, the above conjecture is equivalent with 

Let B and C be two distinct points, let B\ be the midpoint of the segment AC and C\ be the midpoint 
of the segment AB. Then, the segment B\C\ is parallel to BC. 

This example shows that translating a conjecture from geometry terms to algebraic terms and vice 
versa involves dealing with important details. In most systems, a hypothesis of the form AB\\CD is 
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typically represented by the equation of the form (x/, — x a )(yd — y c ) = {xd — x c )(yb —y a ) and moreover, 
this equation is used as a definition for AB\\ CD. However, this approach, unfortunately, breaks the link 
with synthetic geometry. 

It can be shown that most geometry properties are invariant under isometric transformations |[T9ll22ll . 
If P\ and P2 are two free points, there always exists an isometry (more precisely, a composition of a 
translation and a rotation) that maps Pi to the point (0,0) (i.e., the origin of the Cartesian plane) and P2 
to a point on the x-axis. Therefore, without loss of generality, it can be assumed that one free point has 
coordinates (0,0), while another one has coordinates (uq,0) or (0,mq) (although both these choices are 
correct, in some cases the choice could affect efficiency, or, if the partial Wu's method is used, the choice 
could even determine whether it will be possible to do the proof). This assumption can significantly 
reduce the amount of work needed by the algebraic methods. In addition, there are heuristics (aimed at 
improvement of efficiency) for choosing among the free points which two will get these distinguished 
coordinates. 

Example 2 Without loss of generality in the conjecture from Example [7J the points B and C can be 
assigned coordinates 6(0,0), andC(u4,0). Therefore, the algebraic conjecture to be provedis as follows: 

Vmo U\ U4 Xq X\ X2 X3 6 K. 

2xo = uq + uh A 2xi = u\ A 2x2 = «o A 2x3 = «i ==> (*2 — *o) -0 = (x3 — x\)u$. 
which is trivially valid {since x^—x\ = follows from 2x\ = u\ and 2x3 = u\ ). 

3.3 Algebraic Algorithms 

Once the geometric theorem has been algebrized, algebraic theorem proving methods themselves can be 
applied. Algebraic theorem provers use specific algorithms over polynomial systems (each polynomial 
equation of the form pi(vi, . . . ,v„) = P2(v\, . . . ,v„) is transformed to pi(v\, . . . ,v„) — P2(v\, . . . ,v„) = 0, 
i.e., to the form p(v\, . ..,v„) = 0). If f\, fk are polynomials obtained from the construction, and 
gi, . . ., gi are polynomials obtained from the statement, then the conjecture is reduced to checking if for 
each gt it holds that 

k 

Vvi,...,v„ Gi /\f(vi,...,v n ) =0 => g;(vi ...v„) =0. 
i=l 

As Tarski noted, this could be decided by a quantifier elimination procedure for the reals. In practice, 
it is hard to prove nontrivial geometric properties in this fashion, because even sophisticated algorithms 
for real quantifier elimination are relatively inefficient. Therefore, another approach is taken. The main 
insight, given by Wu Wen-tsiin in 1978, is that remarkably many geometrical theorems, when formulated 
as universal algebraic statements in terms of coordinates, are also true for all complex values of the 
"coordinates". So, instead of checking polynomials over reals, the field of complex numbers is used and 
the following conjecture is considered^] 

k 

Vvi,...,v„ € C. f\f(vi,...,v n ) =0 => g;(vi...v„) =0. (1) 

i=l 

This is true when g belongs to the radical of the ideal / = {fi,-..,fk), generated by the polynomials 
fi, i.e., when there exists an integer r and polynomials hi,...,^ such that g\ = L^hjf. Hilbert's 



Of course, this leads to the incompleteness (with respect to geometry) of the methods since, in some cases, the condition 
holds for R, while the methods fails to detect that due to some counterexamples from C. 
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Nullstellensatz theorem states that if the field is algebraically closed (as C is), then the converse is also 
true i.e., such decomposition can always be made. 

The two most significant algebraic methods use a kind of Euclidean division to check the validity of a 
conjecture of the form[T] Buchberger's method consists in transforming the generating set into a Grobner 
basis, in which a division algorithm can be efficiently used, while in the Wu's method a pseudo-division 
is used which closely mimics Euclidean division. 

The main operation over polynomials in Wu's method is pseudo division which, when applied to two 
polynomials p(v\ , . . . , v n ) and q(v\ , . . . , v n ) produces the decomposition 

c'p = tq + r, 

where c(y\, . . . ,v„_i) is the leading coefficient of q in the variable v n , r is the number of non-zero coef- 
ficients of p, f(vi, . . . ,v n ) is the pseudo-quotient, r(v\,. . . ,v„) is the pseudo-remainder, and the degree v n 
in r is smaller then in q. Since r = c' p — tq, it is clear that r belongs to the ideal generated by p and q. 

The first step of Wu's method (the simple form iTTTlD uses the pseudo-division operation to transform 
the construction polynomial system (A;=i fd to triangular form, i.e., to a system of equations where each 
successive equation introduces exactly one dependent variable. After that, the final reminder is calculated 
by pseudo dividing polynomial for statement (gi) by each polynomial from triangular system. 

Summarizing, Wu's method, in its simplest form, allows to compute some polynomials c, hi,...,hk 
and r such that 

k 

c 8i = hi fi + K 

1=1 

If the final remainder r is equal to zero, then the conjecture is considered to be proved. This simple 
method of Wu is not complete (in algebraic sense). A more complex and complete version of the method 
uses ascending chains which are considered in the Ritt-Wu principle. 

3.4 Non-degeneracy Conditions 

An important feature of algebraic theorem proving methods for geometry is that they automatically 
provide conditions — non-degeneracy or NDG conditions — under which the given statement holds. For 
example, once the decomposition cgi = Yli=i n ifi i n the simple Wu's method is obtained, if c{v\ , . . . , v n ) / 
it holds that /\f =1 /,(vi, ... ,v„) = =>■ g,(vi, . . . ,v„) = 0. However, if c(vi, ... ,v n ) = 0, then this 
conclusion cannot be made, so the conjecture is true only for those n-tuples for which a degeneracy 
(given by c) is not present. NDG conditions generated by algebraic methods are given in algebraic terms, 
but in the context of geometry theorem proving, it is important to have their geometrical interpretation 
as well. With NDG conditions given in the form of geometrical assumptions, one gets the final geometry 
theorem, avoiding some degenerate cases that make the original conjecture invalid. However, it still 
might be the case that the statement is valid under some weaker assumptions. In addition, a presence of 
NDG conditions doesn't necessarily mean that if they are not satisfied the conjecture is invalid. 

In Wu's method, NDG conditions are obtained from the triangular system as leading coefficients of 
variables introduced in each polynomial. These coefficients are actually x-polynomials over other vari- 
ables (introduced in triangular system in previous equations). In the Grobner basis method, NDG con- 
ditions are w-polynomials that are denominators of w-fractions which are coefficients of x-polynomials 
obtained during process of reduction to Grobner basis. 

Example 3 In a triangle ABC, let h a , hb, and h c be altitudes that correspond to the vertices A, B, and 
C and let H be the intersection ofh a and hj,. Then, H belongs to h c . The conjectures can be proved by 
algebraic methods in the following way. 
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5(0, «! ) 
\jH(xi,x 2 ) 



C{u 2 ,u 3 ) 



A(0,0) 



Figure 2: Orthocenter theorem 



Let the points A, B, C, and H be assigned coordinates (0,0), (mi,0), (u2,uj), (xi,X2)- The condition 
that AH is perpendicular with the line BC is represented by: {x\ — 0)(w2 — 0) + (xz — 0)(w3 — u\) = 
i.e., («3 — U\)x2 + U2X\ = 0. The condition that BH is perpendicular with the line AC is represented by: 
{x\ — 0)(«2 — 0) + (jc2 — mi)(«3 — 0) =0, i.e., M3X2 + U2X\ — U3U1 = 0. The statement of the conjecture 
corresponds to the condition X2 — U3 = 0. After the triangulation, the polynomial system becomes: 

— U2U\X\ + (— u\u\ + Ut,u\) = 0, 
(W3 —U\)X2 + U2X\ = 

After pseudo-division of X2 — M3 =0 with the above polynomials, the pseudo remainder is 0, which 
prove the conjecture, but only under conditions —U2U\ 7^ and u^—uy 7^ 0, i.e., u\ 7^ 0, U2 7^ 0, u\ 7^ M3, 
corresponding to geometry constraints that A ^ B, A, B and C are not collinear, and BC JL AB. 

It is interesting to notice that the computed NDGs can be weakened: the last two conditions can 
be omitted. Indeed, for all values that make the construction polynomials equal to zero, the statement 
polynomial is also equal to zero. Moreover, the geometry counterpart is also valid under these weakened 
NDG conditions, assuming that XY denotes some line containing both X and Y ( which is relevant in 
some degenerated cases). 



The subtle questions about NDG conditions in algebraic and geometry terms B2l can be rigorously 
analyzed within formalization of algebraic methods. 

As said, NDG conditions give sufficient (i.e., conditions under which calculations applied by Wu's 
method are allowed - they prevent pseudo division with the zero polynomial) and not necessary condi- 
tions for the (extended) conjecture to be valid. We illustrate this by the following example (Simpson's 
theorem). 



Example 4 Let ABC be a triangle and D is arbitrarily chosen point on circumscribed circle of this 
triangle. If K, L and M are respectively foot points of perpendicular lines through D to edges AB, BC 
and CA of triangle ABC, then these three points K, L and M are collinear. 

Let us assume that the following symbolic coordinates are associated to the introduced points: 
A(0,0), B(0,wi), C{u2,uj), D(u4,xi), M(0,xi), N(x2,x^), P(jc4,jcs). The construction corresponds to 
the following system of polynomial equations: 
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U2X1 — U2U\X\ + (W4W2 — W4M3 + U4U3U1 — U4U2) = 
(«3 — «l)jC3 + U2X2 + (— «3 +Ml)xi — W4W2 = 
W2X3 + (— «3 + Wl)*2 — M2«l = 
M3X5 + U2X4 — U3X1 — U4U2 = 

U2X5 — W3X4 = 



and the polynomial equation for theorem statement is X5X2 — X4X3 + X4X1 — X2X\ = 0. 

Following Wu's method, the following NDGs can be extracted: u\ 7^ 0, u% 7^ 0, us 7^ 0, u\ — lu^ux + 

l<2 + u\ 7^ 0, M3 7^ Ml, W3 + «2 7^ 0. 

77je /VDG condition u\ — 1u-$u\ -\-u\ + u\^0 can be interpreted as the following geometry condition: 
Line through points C and B is not perpendicular to line through points B and A, or in other words, the 
triangle ABC is not right triangle with right angle at vertex B. However, this is not a degenerate geometry 
case, but rather a condition required by the application of Wu 's method. Indeed, the theorem holds in this 
special case as well. It can be proved if the point C is constructed on the line through B perpendicular 
to line AB. Then, the points are instantiated as follows: A(0,0), B(0,U[), C{u2,u\), D(uj,x\), M(0,x\), 
N(u3,Ui), P(x2,xs) and the problem is transformed to following polynomial system: 

x\ — u\X\ + (M3 — M3M2) = 
W1X3 + U2X2 — u\X\ — U3U2 = 

U2X3 —U\X2=0 



and with polynomial for statement W3X3 + X2X1 — W1X2 — W3X1 = 0. 

The theorem is proved in this case, with the following two NDG conditions: u\ + u\ 7^ 0, u\ 7^ 0. 
which is equivalent to trivial NDG conditions u\ 7^ 0, U2 7^ 0. 

4 Formalization of Algebraic Methods within Isabelle/HOL 

An important part of our project is to develop a formalization of algebraic methods in Isabelle/HOLp^j 
Our goals are: 

• to identify and separate relevant geometry and algebra concepts and formalize them; 

• to build formally verified geometry axiomatizations and models; 

• to identify key shared features of different algebraic methods; 

• to implement algebraic GATPs that are formally verified, yet efficient enough to handle non-trivial 
geometric statements; 

• to integrate algebraic GATPs to Isabelle/HOL and facilitate their use in education and in formal 
explorations in geometry. 



16 The main authors are Filip Marie and Danijela Petrovic. The current proof documents are available at http: //argo . 
matf .bg. ac . rs 
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In this section we present first steps and some choices made towards this formalization. Although 
our final goals require establishing a connection between algebraic methods and synthetic geometries, at 
the current stage only the connection between algebraic methods and analytic geometry is established. 
However, many early choices in the formalization are made having in mind connections with synthetic 
geometries that will be analyzed in the further stages of our work, and some layers of abstraction are 
introduced to our formalization having in mind these goals. 

4.1 Representation of Geometry Conjectures 

Geometry conjectures (of constructive type) are represented by terms over an abstract syntax. To empha- 
size the constructive nature, along standard geometrical relations (e.g., collinearity, parallelness, perpen- 
dicularity), the syntax supports functions that construct new objects starting from the given ones (e.g., 
construct the midpoint of a segment, construct the intersection of two lines, construct a line trough a 
given point perpendicular to a given line). This makes the system close to the dynamic geometry tools 
like GCLC E6ll and GeoGebra [25] that naturally support this kind of constructions. 

Within Isabelle/HOL, the abstract syntax is defined by mutually recursive datatypes point_term 
(with constructors such as MkPoint, MkMidpoint, Mklntersection, etc.) corresponding to point con- 
structions, line_term (with constructors such as MkLine, MkPerp, MkParallel, etc.) corresponding to 
line constructions, and statement_term (with constructors like Collinear, Congruent, Parallel, 
Perp, etc.) corresponding to geometrical relations. In this syntax, for example, the statement that three 
perpendicular bisectors of edges of a triangle ABC meet in a single point can be represented by the 
following HOL term (the points A, B, C are free points): 

let A = MkPoint 1; B = MkPoint 2; C = MkPoint 3; 

CI = MkMidpoint A B; Al = MkMidpoint B C; Bl = MkMidpoint C A; 

D = Mklntersection (MkPerp (MKLine A B) CI) (MkPerp (MkLine B C) Al) in 

Incident D (MkPerp (MkLine C A) Bl) 

4.2 Interpreting Terms in Geometry Models 

Syntactic terms that represent geometry conjectures can be interpreted in different geometries (e.g., 
Cartesian geometry, Hilbert's geometry, Tarski's geometry). For convenience, Isabelle's locales infras- 
tructure is used to avoid repeating definitions. A locale AbstractGeometry is defined that contains 
primitive^] relations needed to interpret a geometric statement (e.g., incident, between, congruent, per- 
pendicular, parallel) and postulates their properties. Derived concepts are be defined only within this 
locale. For example, the (derived) notion of collinearity reduces to (primitive) notion of incidency — 
three points are said to be collinear iff there is a line that is incident with all three points. Various 
geometries interpret this locale and (abstract) definitions of derived notions transfer to these geometries. 

Semantics of a term (in abstract geometry) is given by functions point_interp, line_interp 
and statement_interp, that take a point_term, line_term or a statement_term and return a 
(abstract) point, a (abstract) line or a Boolean value, respectively. Since abstract interpretation of a term 
is uniquely determined only if the free points are fixed, all these functions take an additional argument 

17 Deciding whether some notion is primitive or derived depends on the underlying geometry. We chose to use a very high 
level abstraction (with more primitive notions) to separate the correctness concerns of algebraic methods with respect to analytic 
geometry from concerns about the internal connections of geometric notions. Separation of primitive and derived notions will 
be handled within formalisation of each specific geometry. 



Marie et al. 



73 



— a function mapping free point indices to points. Since in some degenerate cases it is not possible to 
assign a (unique) geometry object to a term (e.g., when interpreting a term that constructs intersection of 
two parallel lines) construction functions are partial — the interpretation assigns Isabelle/HOL's option 
type values to terms i.e., the term can be interpreted either by the value None (when no object could be 
used as interpretation of the term) or Some o (when the object o is the interpretation of the term). 

Statements are interpreted by the primitive relations of the abstract geometry, while constructions are 
reduced to primitive relations using the Hilbert's choice operator (e or SOME in Isabelle^ For example: 

statement_interp (Incident pi) fp = 

let pi = point_interp p fp; li = line_interp 1 fp in 
if (pi ^ None A pi ^ None) then 

incident (the pi) (the li) 
else 

None 

point_interp (Mklntersection 11 12) fp = 

let Hi = line_interp 11 fp; 12i = line_interp 12 fp in 

if (Hi 7^ None A 12i ^ None A 3 P. incident P (the Hi) A incident P (the 12i)) then 

Some (SOME P. incident P (the Hi) A incident P (the 12i)) 
else 

None 

A statement (represented by a term) is valid in a (abstract) geometry if all its interpretations (i.e., 
interpretations for all choices of free points) that are not trivially degenerate are true. 

definition (in AbstractGeometry) valid : : " statement _term =>■ bool" where 
"valid stmt = V fp. let si = statement_interp stmt fp in 
si =/= None — > si = Some True" 

All the previous notions lift to concrete geometries (e.g., Cartesian geometry, Hilbert's geometry, 
Tarski's geometry) once it is shown that these are interpretations of the AbstractGeometry locale. 

4.3 Formalizing Cartesian Geometry 

Cartesian geometry has been defined within Isabelle/HOL. Points are defined as pairs of real numbers. 
Lines are determined by their equations of the form Ax + By + C = 0, so a line is defined as an equivalence 
class of triplets (A,B,C) where A ^ V B / and triplets are equivalent iff they are proportional. This 
kind of reasoning has been facilitated by support for quotient types and quotient definitions that has been 
recently introduced to Isabelle/HOL BT1 . 

Once the types of points and lines are introduced, they are used to interpret the AbstractGeometry 
locale by introducing primitive geometric predicates and proving their properties (basically axiom-level 
statements). For example, the predicate incident is defined by a quotient definition, stating that a point 
(x,y) is incident to a line determined by the coefficients (A,B,C) iff Ax + By + c = 0, and for example, it 
is proved that there is a unique line incident to two different points. 

18 Note that this decision enables interpreting constructions that do not have unique interpretations (e.g., it is possible to 
construct an intersection of two identical lines, yielding an arbitrary point on those lines), while in the literature these kinds of 
constructions are sometimes considered degenerate. 
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4.4 Algebrizing Geometry Conjecture Terms 



We have implemented a recursive algebraization procedure (outlined in Section 3.2 1 within Isabelle/HOL. 
It keeps track of the current state containing symbolic coordinates of points introduced during the con- 
struction (these are subterms of the original statement term) and polynomials (both for construction and 
the statement) obtained so far. 

The function algebrize takes a statement term and returns sets of construction and statement poly- 
nomials obtained by our algebrization procedure, starting from an empty initial state. The procedure 
descends the term recursively and adds specific polynomials for each construction step or for the state- 
ment. For example, when algebrizing a statement of the form Colinear ABC, the set of statement 
polynomials includes the polynomial (x a —Xb)(yb —yc) — iya —yb){xb —x c ), where (x a ,y a ), (xb,yb), and 
(x c ,y c ) are symbolic coordinates introduced during algebrization of the constructions for point terms A, 
B, and C (during this, the set of construction polynomials was appropriately extended so that it contains 
definitions of these symbolic coordinates in term of symbolic coordinates of free points). 

The central theorem that we have formally proved is that, if all statement polynomials are equal to 
zero whenever all construction polynomials are (i.e., if all statement polynomials belong to the radical 
of ideal generated by the construction polynomials), then the statement is valid in analytic geometry. 



theorem "let (cp, sp) = algebrize term in 

(V ass. ( (V p G cp. eval_poly ass p = 0) — > p G sp. eval_poly ass p = 0)) 
AnalyticGeometry .valid s) " 



4.5 Algebraic Algorithms and Non-degeneracy Conditions 

Currently, for checking polynomial ideal membership problem we use the Grobner bases solver already 
integrated into Isabelle/HOL. It is a general purpose solver, not aimed at geometry, so it is not capable 
of finding the required NDG conditions. Therefore, the user can either specify NDGs or use the most 
general NDGs that can be inferred from the construction description. 



4.6 Further Work: Connections with Synthetic Euclidean Geometries 

Goals of the presented project also include formalizing the connections between algebraic methods and 
synthetic axiomatizations (established via Cartesian plane). We will consider only full Euclidean geom- 
etry (and not affme geometry). The first step would be to formalize Tarski's elementary geometry within 
Isabelle/HOL. Then, it would be necessary to prove that every valid statement in analytic geometry also 
hold in all models of Tarski's geometry. The central (most demanding) part would then be to formally 
prove the deductive completeness of Tarski's geometry (by formally establishing the connection with 
real-closed fields and formally showing that they allow quantifier-elimination). The next step would be 
to analyze the connections with Hilbert's geometry by showing that all proofs in Tarski's geometry can 
be transferred to Hilbert's geometry (since all Tarski's axioms can be proved within Hilbert's geometry). 
Finally, if an algebraic method is proved to be correct (in purely algebraic terms), then it would follow 
that it is correct when used for geometry theorem proving. 
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5 Implementation of Algebraic Methods in Java 

In this section we briefly describe our open-source, documented OpenGeoProver software implemented 
in Java[^]aimed at providing self-contained support for algebraic theorem proving methods for geometry 
and, also, for Wu's method and the Grobner bases method themselves. OpenGeoProver is an improved 
reimplementation of the algebraic methods implemented in C++ within the GCLC tool IT361 . Currently, 
only Wu's method is supported, but we are planning to implement the Grobner bases method as well. 
Currently, there is no implementation of B-reduction, the main operation in Grobner basis method, but 
there is a whole infrastructure for its implementation (like representation of ^-fractions, order among 
x-terms etc), so the remaining work is relatively small. 

OpenGeoProver consists of two main modules: one provides support for algebraic methods and the 
other is a set of APIs from different geometric applications and formats to the prover. OpenGeoProver 
produces detailed reports with steps that are taken in proving process: transformation of input geometric 
problem to algebraic form, invoking a specified algebraic-based theorem prover, and presenting the result 
with time and space spent to prove the theorem and with a list of NDG conditions obtained during proving 
process, transformed to a readable, geometry form. 

One of the main purposes of this Java implementation is integration with various dynamic geometry 
tools (including GeoGebra) that currently don't have support for proving geometry theorems. The archi- 
tecture of OpenGeoProver enables easy integration with other systems for interactive geometry and can 
be simply modified to accept various input formats for conjectures. 



5.1 Representation of Geometry Conjectures and Algebraization of Geometry Conjec- 
tures 

OpenGeoProver uses its internal structure to store information about the given conjecture. This structure 
is called CCP (Conjecture Construction Protocol) inspired by GeoGebra's Construction Protocol that 
contains the list of all constructions used in building the scene in GeoGebra. Likewise, CCP contains 
the list of all construction steps that build the scene for the conjecture and, in addition, it contains the 
goal that has to be proved. Also, CCP contains an algebraic form of the conjecture in the form of a set 
of polynomials. In addition, CCP stores information about NDG conditions for the conjecture, once they 
are returned from the algebraic theorem proving method. 

CCP is responsible for assigning symbolic coordinates to constructed points and for the process of 



algebraization of geometry conjectures. This processes are performed in the way outlined in Section 3.2 
constructed points are instantiated by pairs of symbolic variables that represent their coordinates, and 
polynomials are generated from conditions that some point belongs to some set of points (e.g. intersection 
point of line and circle belongs to that line and to that circle). 

Although it would be sufficient to provide support for only a few types of construction steps, for 
convenience CCP can deal with various types of constructions and statements, used in various geometry 
systems. Currently, it supports more than 20 types of constructions steps and 20 types of statements. 
Even if some construction isn't currently supported, it may be possible to express it by two or more 
existing construction steps. Also, it is possible to easily further extend these sets. 



The main author of the implementation is Ivan Petrovic. The source code is available at http : //argo . matf . bg . ac . rs 
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5.2 Algebraic Algorithms and Non-degeneracy Conditions 



Non-degeneracy conditions (in algebraic terms) are computed as outlined in Section 3.4 There are algo- 
rithms that transform algebraically specified conditions from polynomial form to its geometry counter- 
part (|4l|43l)- The idea for our, currently implemented, algorithm for transformation of NDG conditions 
is as follows: 

• all variables that appear in polynomial form of a NDG condition are extracted; 

• for each of these variables, all points from CCP whose some coordinate has been instantiated with 
it are extracted; 

• combinations of these points are generated, but taking care to cover all variables; 

• these combinations of points are extended by points with zero coordinates (since they are neutral 
— they could not introduce new variables to existing set of extracted variables); 

• for each combination, a number of possibilities is considered, depending on the number of points 
(e.g., for 3 points A, B and C we have the following specific positions: they are collinear, or C is 
on circle with center A and one point B, or line AB is perpendicular to AC, etc.); 

• for each specific possibility, its polynomial form is considered and checked whether it matches the 
input NDG polynomial. 

Sometimes, it is possible to get several positions among points that satisfy the same NDG polynomial, 
and we choose the one that includes free or random points, i.e. the one that can be easily adjusted to 
affect other constructed objects. 

Note that the above procedure is not complete and there are some NDG conditions that cannot be 
translated to the geometry form. 



5.3 Implementation Issues 

Algebraic primitives, like variables, powers, terms and polynomials, are key elements for algebraic theo- 
rem provers and it is essential to use data structures suitable for efficient algorithms. In OGP, polynomial 
terms, for example, are organized as sorted lists of powers of variables in descending order. This enables 
easy and efficient comparison of terms and searching for a power. Polynomials are organized as self- 
balanced binary search trees (i.e., objects of the java.util .TreeMap class) and this ensures efficient 
multiplying of polynomials which is one of critical operations in Wu's method. 

Geometry objects include geometry constructions, theorem statements and NDG conditions. Each 
geometry construction and each statement have their algebraic form which is used to transform the input 
problem to polynomials in order to apply the prover. All geometric constructions are stored in an indexed 
list and a construction map, all within CCP which contains input theorem. This enables simple access to a 
specific construction step either by its index (or by the label of constructed object which must be unique). 
NDG conditions initially have only polynomial form, and after the prover's completed execution they are 
transformed to relationships among geometry objects. To determine that relationship, OGP uses objects 
for theorem statements, which already describe relationships, and transforms them to the polynomial 
form to check if it matches the polynomial for the specific NDG condition. 



5.4 Experimental Results 

There are several implementations of automated geometry theorem provers, including provers based on 
algebraic methods but only the newest Java Geometry Expert (JGEX) is open-source and self-contained 



Marie et al. 



11 



Theorem 


JGEX 


OGP 


Example 6 (variant of; Pascal's thereom for circle) 


1077 


138 


Example 36 (butterfly) 


1745 


289 


Example 133 (variant of; orthocenter is incenter of triangle formed of altitudes' feet) 


7 


7 


Example 154 (incenter) 


38 


15 


Example 173 (orthocenter) 


9 


6 


Example 191 (Euler's line) 


10 


162 


Example 196 (Euler's/nine-point circle) 


9 


50 


Example 288 (Simson's theorem) 


25 


16 


Example 336 (Gergonne's point) 


58 


51 


Example 346 (Desargues' theorem) 


21 


221 



Table 1: Results of experimental comparison between OGP and JGEX (all execution times are given in 
milliseconds 



(does not require other systems or libraries). Because of that, we made a limited experimental comparison 
between OGP and JGEX (using Wu's method), as the most closely related tools (also both implemented 
in Java). Despite similarities, there are also differences between these two systems. In contrary to 
OGP, JGEX is developed for a specific geometry tool in mind that provides a graphical interface to the 
underlying provers and additional features such as proof visualization. Concerning the provers itself, 
OGP currently implements only the partial Wu's method, while JGEX implements the complete Wu's 
method (so, there are theorems that can be proved by JGEX but not by OGP). In JGEX, geometric 
constructions are transformed to polynomial form and triangulated one by one while objects are being 
constructed by the user in the drawing panel. When the user press the button to prove a theorem, only 
the final reminder has to be calculated. 

For experimental comparison between JGEX and OGP, we selected, rather arbitrarily, ten theorems 
in realm of both provers from Chou's Collection ifTTTl . We compared only the time of the final pseudo 
reminder calculation (due to the JGEX style of operation), which is basically the most time-consuming 
part of Wu's method. Results, given in Table[TJ of this limited experimental comparison, of course cannot 
be conclusive, but they suggest that OGP is comparable with JGEX (or even slightly more efficient). 

6 Future Integrations 

The components described above could and should be integrated with other tools and resources, as il- 
lustrated in Figure [3] A dynamic geometry tool can enable user to enter geometry constructions and 
conjectures or to import them from some repository (e.g., GeoThm^j). The user should also be able to 
export his/her construction to the repository, and this communication should be performed using some 
of the emerging standard interchange formats (e.g., i2g). The dynamic geometry tool can invoke an 
automated theorem prover on demand (to check if some conjecture is indeed a theorem), or to check if 
some construction is regular |[28l . In either case, a conjecture, in the form of conjecture construction 
protocol, is sent to Isabelle (if a verified answer is required) or to OpenGeoProver (if efficiency is more 
important than a verified answer). If invoked, our geometry prover in Isabelle calls OpenGeoProver (or 
some other algebraic tool) and asks for a witness certificate (e.g., a Grobner basis) that can be used for 
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verified proving of the given conjecture. Finally, the yes/no answer and NDGs are returned by Isabelle 
prover or OpenGeoProver back to DGS. This interaction can bring new features to existing DGS and 
other related systems. For instance, compared to GeoProof, the proposed system would also provide 
efficient Java provers, while, compared to GCLC, the proposed system would provide link to a proof 
assistant and verified reasoning. 



Geometry 
Repository 

i .interchange 
format 



certified yes/no answer 



DGS 



uncertified yes/no answer 



comtHictlon conjecture^fotocol 
set of polynomials 



IsaGeoProver 



witness certificate 



OpenGeoProver 



Figure 3: Integration of DGS, Isabelle geometry prover and OpenGeoProver 

The proposed system with described interaction can be relevant for new applications in education. 
However, its components can be also useful in formal explorations of geometry. Namely, the algebraic 
methods implemented as Isabelle tactics would be helpful for many intermediate tasks along building a 
formalized body of geometry knowledge. 



7 Applications in Education 

Currently, automated and interactive theorem proving is used in education (especially, in pre-university 
education) only very sporadically and in a very limited scope. Despite that fact that theorem proving 
can be very demanding for pupils/students to deal with (to use and understand), we believe that theo- 
rem proving can be used in a realistic and beneficial way. In the following we list some potentials for 
applications of the presented integrated framework, outlined in Section|6| in education. 

• Geometry has been in education widely used as a theory suitable for introducing the notions of 
axiom system and proof. Students are typically understand and exercise logical inferences on 
a concrete theories (e.g., Euclidean geometry) and on concrete examples. The same could and 
should be applied for introducing the concept of formal, machine verifiable proof to the students. 
Students of the modern, computer era should understand that proofs are strict, formal objects 
subject to verification by a computer, especially bearing in mind expectations that in some decades 
a significant portion of mathematics will be machine verifiable. In this context, as with "informal 
proofs", geometry is very suitable (because of its intuitive and visual nature). Having a setting 
in which only some relevant fragments have to be provided by the student (and the rest provided 
by already developed infrastructure), make this learning scenario realistic and useful, of course, if 
suitable adapted to specific cases. 
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• In mathematical education, mathematical knowledge requires justification and explanation for con- 
jectures. Algebraic methods in geometry, however, provide just the former one. Still, they (within 
a proof assistant or as a stand-alone implementations) can be used to help students for filling gaps 
in their proofs. This way, some gaps can turned to be based on wrong conjectures, or may reveal 
additional conditions needed for the conjecture to be valid. 

• With most of dynamic geometry tools, a student typically explores his/her construction, trying to 
visually or numerically detect or check that certain conjecture is always true. However, there is 
no mathematical argument in such conclusion and this may even lead to some misconceptions in 
student's understanding of geometry. Instead of this "visual checking", a theorem prover can be 
used for bringing precise, mathematical conclusions. This way, a DGS can still keep its role, but 
now enriched with strict mathematical arguments. In this scenario, either an algebraic method 
formalized within a proof assistant or (preferably) a light-weight, stand-alone automated theorem 
prover integrated within the DGS can be used. 

• In mathematical education, geometry construction problems are one of the standard (and, in the 
same time) most challenging types of problems: given some constraints, the student is asked to 
construct a geometry figure meeting these constraints. The construction is often accompanied by 
some kind of analysis, visualization and an informal proof. Still, these proofs often tend to be 
flawed, most often due to degenerate cases. In this scenario, the student could used an algebraic 
method for checking his/her construction, before even attempting to prove that it is correct. If it is, 
the student have to prove it in a way that is expected from him/her. 

We stress that our presented framework currently does not provide an interface needed for applica- 
tions in education, but give all key infrastructure. 

8 Conclusions 

In this paper we presented our ongoing project on implementation and formalization of algebraic methods 
for geometry theorem proving. The project should hopefully help wider applications of the algebraic 
methods in education and formalization of mathematics. 
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